Mapping biomedical terminologies using natural language processing tools and UMLS: mapping the Orphanet thesaurus to the MeSH
نویسندگان
چکیده
Background: Orphanet aims to provide rare disease information to healthcare professionals, patients, and their relatives. Objective: The objective of this work is to evaluate two methodologies (UMLS and manual Orphanet-ICD-10 link-based mapping & String Based matching) used to map Orphanet thesaurus to the MeSH thesaurus. Results: On a corpus of 375 mappings, the string based matching provides significantly better results than the UMLS and manual Orphanet-ICD-10 link-based mapping. Conclusion: String based matching could be applied to any biomedical terminology in French not yet included into UMLS. Mapping biomedical terminologies using natural language processing tools and UMLS: mapping the Orphanet thesaurus to the MeSH Tayeb Merabti, MS 1, 2 , Michel Joubert, PhD 2 , Thierry Lecroq 1 , PhD, A. Rath, MD 3 , Stefan J. Darmoni, MD, PhD 1 1 CISMeF, University Hospital, Rouen, France & TIBS, LITIS EA 4108, Institute of Biomedical Research, University of Rouen, France; 2 LERTIM EA 3283, Faculty of Medicine, Marseille, France; 3 Orphanet INSERM SC11, Paris, France Abstract Background: Orphanet aims to provide rare disease information to healthcare professionals, patients, and their relatives. Objective: The objective of this work is to evaluate two methodologies (UMLS and manual OrphanetICD-10 link-based mapping & String Based matching) used to map Orphanet thesaurus to the MeSH thesaurus. Results: On a corpus of 375 mappings, the string based matching provides significantly better results than the UMLS and manual Orphanet-ICD-10 link-based mapping. Conclusion: String based matching could be applied to any biomedical terminology in French not yet included into UMLS.Background: Orphanet aims to provide rare disease information to healthcare professionals, patients, and their relatives. Objective: The objective of this work is to evaluate two methodologies (UMLS and manual OrphanetICD-10 link-based mapping & String Based matching) used to map Orphanet thesaurus to the MeSH thesaurus. Results: On a corpus of 375 mappings, the string based matching provides significantly better results than the UMLS and manual Orphanet-ICD-10 link-based mapping. Conclusion: String based matching could be applied to any biomedical terminology in French not yet included into UMLS.
منابع مشابه
Mapping the ATC classification to the UMLS metathesaurus: some pragmatic applications.
ATC classification is a WHO international classification used to classify drugs. The aim of this paper is to evaluate two lexical methods in English and in French to map ATC to UMLS. Several applications have been impemented to illustrate the use of the ATC mapping in English and French: (a) MeSH translation in Norwegian, (b) Drug Information Portal, and (c) ATC to PubMed tool. Two lexical meth...
متن کاملEnhanced LexSynonym Acquisition for Effective UMLS Concept Mapping
Concept mapping is important in natural language processing (NLP) for bioinformatics. The UMLS Metathesaurus provides a rich synonym thesaurus and is a popular resource for concept mapping. Query expansion using synonyms for subterm substitutions is an effective technique to increase recall for UMLS concept mapping. Synonyms used to substitute subterms are called element synonyms. The completen...
متن کاملTowards linking patients and clinical information: detecting UMLS concepts in e-mail
The purpose of this project is to explore the feasibility of detecting terms within the electronic messages of patients that could be used to effectively search electronic knowledge resources and bring health information resources into the hands of patients. Our team is exploring the application of the natural language processing (NLP) tools built within the Lister Hill Center at the National L...
متن کاملEnhancing LexSynonym Features in the Lexical Tools
Concept mapping is vital to natural language processing (NLP) for bioinformatics. Query expansion using synonyms for subterm substitutions is an effective technique to increase recall when no direct concept mapping can be found through normalization. For example, no concept can be found by direct mapping through normalization if the source vocabulary is “calcaneal fracture”. By substituting the...
متن کاملEffective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program
The UMLS Metathesaurus, the largest thesaurus in the biomedical domain, provides a representation of biomedical knowledge consisting of concepts classified by semantic type and both hierarchical and non-hierarchical relationships among the concepts. This knowledge has proved useful for many applications including decision support systems, management of patient records, information retrieval (IR...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010